Multilingual thesauri and ontologies in cross-language retrieval
نویسنده
چکیده
This paper sets forth a framework for the use of thesauri and ontoiogies as knowledge bases in cross-language r trieval. It provides a general introduction to thesaurus functions, structure, and construction withparticular attention to the problems of multilingual thesauri. It proposes the creation of environments for distributed collaborative knowledge base development as a way to make large-scale knowledge-based ystems more affordable.
منابع مشابه
Cross-Language Information Retrieval in a Multilingual Legal Domain
We describe here the application of a cross-language information retrieval technique based on similarity thesauri in the domain of Swiss law. We present the theory of similarity thesauri, which are information structures deerived from corpora, and show how they can be used for cross-language retrieval. We also discuss the collections of Swiss legal documents and show how we have used them to co...
متن کاملAutomatic processing of multilingual medical terminology: applications to thesaurus enrichment and cross-language information retrieval
OBJECTIVES We present in this article experiments on multi-language information extraction and access in the medical domain. For such applications, multilingual terminology plays a crucial role when working on specialized languages and specific domains. MATERIAL AND METHODS We propose firstly a method for enriching multilingual thesauri which extracts new terms from parallel corpora, and seco...
متن کاملProviding Cross-Lingual Information Access with Knowledge-Poor Methods
We are proposing a simple, but efficient approach for a number of multilingual and cross-lingual language technology applications that are not limited to the usual two or three languages, but that can be applied with relatively little effort to larger sets of languages. The approach consists of using existing multilingual linguistic resources such as thesauri, nomenclatures and gazetteers, as w...
متن کاملThe Eurospider Retrieval System and the TREC-8 Cross-Language Track
This year the Eurospider team, with help from Columbia, focused on trying different combinations of translation approaches. We investigated the use and integration of pseudo-relevance feedback, multilingual similarity thesauri and machine translation. We also looked at different ways of merging individual crosslanguage retrieval runs to produce multilingual result lists. We participated in both...
متن کاملAligning Multilingual Thesauri
The aligning and merging of ontologies with overlapping information are actual one of the most active domain of investigation in the Semantic Web community. Multilingual lexical ontologies thesauri are fundamental knowledge sources for most NLP projects addressing multilinguality. The alignment of multilingual lexical knowledge sources has various applications ranging from knowledge acquisition...
متن کامل